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We investigate algorithms to find short paths in spatial networks with stochastic edge weights. 
Our formulation of the problem of finding short paths differs from traditional formulations because 
we specifically do not make two of the usual simplifying assumptions: (1) we allow edge weights 
to be stochastic rather than deterministic; and (2) we do not assume that global knowledge of a 
network is available. We develop a decentralized routing algorithm that provides en route guidance 
for travelers on a spatial network with stochastic edge weights without the need to rely on global 
knowledge about the network. To guide a traveler, our algorithm uses an estimation function that 
evaluates cumulative arrival probability distributions based on distances between pairs of nodes. 
The estimation function carries a notion of proximity between nodes and thereby enables routing 
without global knowledge. In testing our decentralized algorithm, we define a criterion that allows 
one to discriminate among arrival probability distributions, and we test our algorithm and this 
criterion using both synthetic and real networks. 

PACS numbers: 89.75.Hc, 89.40.-a, 84.40.Ua, 89.20.Hh 



Introduction. One of the most important aspects of 
many networks is their navigability [lH3| , and it is often 
important to find short paths between pairs of nodes in a 
network. For example, sending packages across the Inter- 
net, attempting to spread ideas through social networks, 
and transporting people or goods cheaply and quickly all 
require the ability to find paths with a minimal number 
of steps or a minimal cost Q- Assuming that the net- 
work topology and the cost of making a step is known, 
such paths can be found easily [Hj. Unfortunately, com- 
plete knowledge of network topology (and edge weights) 
is often not available or constitutes an insurmountable 
overhead 

Despite the aforementioned difficulties, empirical evi- 
dence demonstrates that some networks can be navigated 
by using only local information [soj . A well-known ex- 
ample is Milgram's small world experiment which 
showed that short paths between individuals in social 
networks exist and that individuals are able to navi- 
gate the network without global knowledge of network 
topology. This observation was put on solid theoreti- 
cal ground more than 30 years later by Kleinberg [lo| . 
who showed that one can find short paths between nodes 
via decentralized algorithms in certain types of spatially- 



ings is their assumption that the cost of making a step 
is deterministic. In many situations, it is much more 
appropriate to model the cost as a random variable. For 
instance, varying levels of congestion on networks [l7l - [l9j 
make it unsuitable to model such costs deterministically. 
The aim of this Letter is to address this important limita- 
tion and to develop a decentralized algorithm for routing 
in stochastic networks. 

Deterministic Shortest-Path Problem (DSPP). A net- 
work G consists of a set of nodes N labeled by indices 
{ii,...,i„} (and with cardinality |A^| = n) and a set 
of edges E (with \E\ = m) labeled by ordered pairs of 
indices which indicates that there is a directed 

edge from node j to node i. We associate a weight 
Tij, representing a cost or travel time, with each edge 
(i, j). A path I with k steps is a sequence of fc -I- 1 nodes 
t — . . . , «fc-|-i} that are connected to one another via 
edges. The weight Ti of a path I is given by the sum of 
the weights of its constituent edges: 



embedded networks 11 



This work has led to a series of 
theoretical and numerical studies on routing with limited 
information [3, [l^l and on the importance of embedding 
a network into space when developing routing algorithms 

An important limitation of the above theoretical find- 



Thc shortest-path problem (SPP) aims to determine the 
path from an origin node to a target node that has the 
smallest total weight. In the DSPP, each edge weight 
is deterministic, and a path with minimal total weight is 
considered to be optimal. 

Stochastic Shortest-Path Problem (SSSP). To define an 
SSSP, we let the weights Tij be real-valued random vari- 
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ables that are distributed according to a probability dis- 



tribution function (PDF) with probabilities pij [18l-l22|. 
For example, non-deterministic travel times are a typical 
feature of transportation networks [l7| . Because this is 
our motivating example, we will use the terms "time" and 
"weight" interchangeably. In our SSSP formulation, we 
make three assumptions: (1) the random edge weights are 
independent of each other; (2) the PDFs do not change 
during the routing process; and (3) the weight incurred 
by traversing an edge becomes known upon completion 
of the step. (For example, the time taken to travel a road 
is known once the next junction is reached.) With these 
assumptions, it follows that the PDF for the weight Ti of 
a path (. to have a value t is given by the convolution of 
the PDFs of the weights associated with the path [2^ : 



k 

* Pi. 



(1) 



where the right-hand side denotes k consecutive convo- 
lutions. The probability to traverse the path £ and incur 
a weight < t is given by the cumulative distribution 
function (CDF) 



dt' Pi (t') 



Criteria. Because the edge weights are now random 
variables, we need to reconsider the concept of an optimal 
path. In particular, there is no longer a unique concept 
of optimality. For example, Frank [20[ defined a path to 
be optimal if its CDF surpasses a threshold 9 within the 
shortest time, whereas Fan et al. [3l suggested maxi- 
mizing the CDF for a given time budget r. Each of these 
criteria has a regime in which it outperforms the other. 
If the budget r available to a traveler is large, then there 
are many paths leading to almost certain arrival at the 
desired target; that is, ui> (r) « 1 for many paths t . In 
this regime, the paths are virtually indistinguishable us- 
ing Fan et al.'s criterion. However, Frank's criterion can 
easily identify the path that it deems to be optimal. In 
contrast, if the budget is small, then arrival at the target 
within the budget is unlikely; that is, U£" (t) <^ 1 for all 
paths £". In this case, Frank's criterion is not helpful be- 
cause there does not exist a path whose CDF surpasses 
the threshold. However, Fan et al.'s criterion can iden- 
tify the path with the maximal CDF for the given budget. 
With the above in mind, we define a joint criterion that 
takes advantage of both criteria. If there are paths whose 
CDFs surpass a threshold 9 within the budget r, then we 
choose a path according to Frank's criterion. Otherwise, 
we choose a path according to Fan et al.'s criterion. Fig- 
ure [T] illustrates the differences between the criteria. 

An Adaptive Algorithm. Even finding approximate so- 
lutions to the SSPP is challenging. An interesting ap- 
proach was proposed by Fan et al. , who utilized an adap- 
tive algorithm that evaluates the available information 
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FIG. 1: Comparison of path optimality criteria using CDFs 
of three paths. Fan et al.'s criterion prefers paths 2 and 3 to 
path 1 but cannot discriminate between the CDFs of paths 2 
and 3. Frank's criterion prefers path 2 to path 3 but cannot 
be applied to path 1. The joint criterion is applicable to all 
CDFs and chooses path 2 as the optimal one. 



before each step and account for the consequences of pre- 
vious decisions [l8| . They proposed building a routing ta- 
ble by considering the maximal probability to reach the 
target r from all other nodes i G iV\ {r}. This amounts to 
solving the following set of nonlinear integral equations: 



Ml (i) 

Ur {t) 



max 
1, 



Pij {t') Uj [t-t') dt' 



(2) 
(3) 



where Ui it) is the probability to arrive at node r starting 
from node i with a total time no longer than t and Ji is 
the set of neighbors of i. The node qi (t) that should be 
chosen to attain the maximal arrival probability is 



arg max 



{t') Uj {t - t') dt' 



(4) 



One cannot find analytical solutions to Eqs. ©-(HI) in 
general, but the CDF Ui (t) can be approximated using 
the recursive sequence 



it) 
it) 



max 

je.h 

1, 



P^J{t')v'^ {t-t') dt' 



(5) 



with index s and initial conditions 

z;°(i) = 0, V^e7V\{r}. 

The sequences {u| (t)} give lower bounds for the true 
CDFs. The sequences {w^ (t)} with the same recursion 
relation ([5]) but with different initial conditions 



gives an upper bound [2J|. In our numerical implemen- 
tation, we demand that the sequences converge to within 
a numerical tolerance e for all t. That is, we require that 

<(t)-<(t)<e, ViGiV 
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for sufficiently large s. 

Estimation function. Fan et al.'s algorithm is central- 
ized, as it requires a knowledge of a system's geometry to 
determine shortest paths. To build a decentralized algo- 
rithm, we define an estimation function f {i,j]t), which 
gauges the arrival probability from node j to i within 
time t. Such a function carries a notion of proximity be- 
tween nodes, and we will use it to guide travelers on a 
network. 

To define an estimation function, we proceed as follows. 
First, we embed a network in a metric space and denote 
by dij the distance — i.e., the length of the edge — between 
nodes i and j. Second, we define the network distance gij 
as the shortest distance between nodes if travelers are 
restricted to move along edges. Note that the network 
distance is distinct from the travel time. We assume that 
the network distance can be estimated from the distance 
between edges. That is, we assume that there exists a 
function h such that gij ~ h [dij ) . Such an assumption 
is implicit in all decentralized algorithms using a metric 
for guidance. Third, we note that the expected number 
of steps necessary to reach node i from node j is 
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where A denotes the mean edge length in the network. 
Fourth, in the absence of any further knowledge about 
the network, we assume that the weight i incurred by 
making a step towards the target is chosen uniformly at 
random from the weights associated with the edges. The 
PDF of i is the mixture distribution 1251 



Pit) 



and we estimate the weight of the unknown path £ from 
j to i to be 

fe 

fe=i 

We make indirect use of global knowledge by using A and 
p (t) as a characteristic length scale and PDF. In practice, 
however, one can calculate such characteristics by sam- 
pling a network (e.g., using traffic- fiow measurements), so 
direct global knowledge is not required. In other cases, 
such characteristics might even be known a priori. Using 
Eq. ([U, the associated CDF is 



dt' 



if i 



J 



We thereby use physical distance to evaluate the num- 
ber of steps between two nodes, and we assume that the 
random weight associated with each edge is uncorrelated 
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FIG. 2: [Color online] (a) A lattice with the origin given by a 
square and the target given by a star, (b) The data available 
to a decentralized algorithm before the first step. We show a 
network traveler's current node as a square and frontier nodes 
as triangles. We enclose the known subgraph Gk by a dashed 
contour. Panel (c) shows the available data before the second 
step, and panel (d) shows the available data before the third 
step. 



with the length of the edge. In the Supplementary On- 
line Material (SOM), we show that the order of carrying 
out mixtures and convolutions is irrelevant. 

Decentralized algorithm. Our algorithm explores a net- 
work by using local information, and it chooses a locally 
optimal node according to one of the criteria discussed 
above. The nodes that have been visited Ny and the 
frontier nodes A^f constitute a known subgraph Gk (he., 
the parts of the graph that the traveler has discovered) . 
Frontier nodes are neighbors of visited nodes but have 
not yet been visited themselves. The known subgraph 
includes all edges of G that are connected to the known 
nodes A^k = Ny U A^f • Importantly, naively stepping to- 
wards the node that is a locally optimal choice without 
incorporating information about the journey to date can 
trap a traveler in a dead end. Developing an algorithm 
with knowledge of Gk enables a traveler to navigate out 
of dead ends. 

In this local approach, we build on Fan et al.'s algo- 
rithm [H, 24| and apply it to Gk by changing the initial 
conditions of the sequences {wf {t)} and {wf (t)} for fron- 
tier nodes using the estimation function: 



z;°(i) = w;°(0 = /(j, r;0 , Vj G iVp . 

We initialize nodes that have been visited in the same 
manner as before. We then iterate the two sets of se- 
quences until they converge to within a chosen tolerance 
e. The traveler then moves to a successor node that is 
identified by one of the criteria, and we then update Gk. 
We reduce the remaining budget by the weight incurred 
by making the step. We repeat this process until the 
traveler reaches the target or the budget is exhausted. 
Figure illustrates this routing process on a lattice. 

We require [t] to provide upper bounds for the 
CDFs. In centralized routing, such an upper bound is 
equal to 1 because the target is part of the network under 
consideration. In our decentralized situation, the upper 
bound for the CDF of a node in Gk is 

/max(0 = max [f{r,j;t)] . 
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Changing the initial conditions of visited nodes to 

accelerates the convergence of the two sets of sequences 
because it reduces the initial differences between them. 
We provide pseudocode for our decentralized algorithm 
in the SOM. 

Simulations. Noland et al. (l7j (and references 
therein) investigated traffic flow data sets and found 
that they possess lognormal travel time distributions, 
so we use this class of distributions for our simulations. 
To determine edge weights, we assign lognormal PDFs 
pin ifJ-, cr; t) with mean ^ and standard deviation a cho- 
sen uniformly at random in the interval [0.5,1.5]. The 
relevant mixture distribution is thus 
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da Pin {fi,cr;t). 



We consider four different tests: we use Fan et al.'s cen- 
tralized algorithm and our new decentralized algorithm 
in combination with both Fan's criterion and the joint 
criterion that we described previously. 

We test our algorithm on a variant of Klcinbcrg's 
small- world network [2^. We start with a 10 x 10 
square lattice with an undirected edge between neighbor- 
ing nodes in the grid, and we also assign an undirected 
shortcut edge from each node i to exactly one other node. 
We determine the shortcut's destination node using inde- 
pendent random trials such that the probability of such a 
long-range edge is proportional to 1/ D^j , where Dij is the 
lattice distance between i and j. (In determining short- 
cuts, we discard duplicate edges.) We let dij = Dij and 
approximate the network distance by gtj ~ h (dij ) = dtj . 
The mean edge length of an ensemble of 10"^ networks is 
A « 1.421 ± 0.002. Suppose that the origin of the routing 
process is at (2, 2) and that the target is at (9, 9), where 
(x, y) designates a node using its lattice coordinates. We 
run each test 10^ times for a range of budgets (see Fig. [3]), 
a CDF threshold of = 0.8, and a tolerance of e = 10~^. 
(An error in arrival probability smaller than a tenth of a 
percent will not affect real travelers.) 

As shown in Fig. El^a), the arrival fraction — i.e., the 
fraction of routing attempts that reach the target within 
a given budget — increases with increasing budget. Cen- 
tralized algorithms know the entire network topology and 
can thus make better decisions, so they have larger arrival 
fractions. As shown in Fig.[5]in the SOM, the arrival frac- 
tion of Fan et al.'s centralized algorithm using the joint 
criterion is almost independent of the threshold. Because 
the local neighbors of any node are located in the cardi- 
nal directions, the arrival CDFs are sufficiently different 
for CDF maximization and travel time minimization to 
agree. We obtain the same results using our decentral- 
ized algorithm. Travelers thus choose the same successor 
node irrespective of the threshold 6; this results in the 
same arrival fraction. 
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FIG. 3: [Color online] Fraction of routing attempts that suc- 
cessfully reach the target on the (variant of the) Kleinberg 
network as a function of budget for a CDF threshold of S = 0.8 
and a tolerance of e = 10"'^. 



Because centralized algorithms are aware of all short- 
cuts in a network, they have smaller mean travel times 
than decentralized ones (see Fig. |4|). For small budgets 
(t < 10), the travel times of all algorithms increase with 
increasing budget. The algorithms choose neighbor nodes 
to visit to maximize the CDF, which results in longer 
travel times because it is advantageous to exhaust the 
budget. For budgets r satisfying 10 < t < 17, the travel 
times using Fan et al.'s criterion and our joint criterion 
start to differ. The joint criterion starts to minimize 
travel times in this regime until they approach a steady 
value. Fan et al.'s criterion, however, continues to max- 
imize CDFs such that travel times grow with increasing 
budget. For larger budgets (r > 17), Fan et al.'s criterion 
is unable to distinguish between the CDFs of neighbor- 
ing nodes, and algorithms using this criterion enter an 
unguidcd phase (i.e., one can construe a traveler to be 
"lost"). The algorithm steps to neighbor nodes seem- 
ingly at random until the budget decreases sufficiently 
for Fan et al.'s criterion to discriminate among CDFs. 
As illustrated in Fig. HI the travel time using Fan et al.'s 
criterion increases linearly with the budget in this regime. 

In the SOM, we discuss simulations using a network 
for the Chicago area and illustrate the use of Euclidean 
distance (in the physical space of the network) versus 
network distance. 

Conclusions. We have examined decentralized routing 
on networks with stochastic edges weights. Our contri- 
butions are twofold. First, we have introduced a new 
criterion to discriminate among the CDFs of paths. It 
circumvents the limitations of Fan's and Frank's criteria 
while retaining the desirable properties of both because it 
minimizes travel times without a de facto sacrifice of reli- 
ability. It also provides a better caricature of the behav- 
ior of real travelers [27[. Second, we have developed a de- 
centralized routing algorithm applicable to networks with 
stochastic edge weights. Our algorithm employs a CDF 
estimation function that captures a notion of proximity 
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FIG. 4: [Color online] Mean travel time of successful routing 
attempts. We separate small, intermediate, and large budgets 
using dashed vertical lines. The error bars correspond to three 
standard deviations from the mean. 



in space and guides network travelers without the need 
to incorporate global knowledge about a network. Our 
simulation results demonstrate that decentralized routing 
on networks with stochastic edge weights is viable. This 
type of approach appears to be very promising, and in- 
vestigating the limitations of such an approach and also 
the situations in which it can best succeed are impor- 
tant topics for future research. Possible improvements of 
our algorithm include the development of more sophis- 
ticated choices of estimating functions that incorporate 
edge lengths, edge weights, and their correlations. We 
expect this work to be particularly interesting in studies 
of routing on temporal networks, in which the existence 
and other properties of edges are time-dependent. 
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Pseudocode for Our Decentralized Algorithm 

In Algorithm [l] we give pseudocode for our decentralized routing algorithm for networks with stochastic edge 
weights. 



Algorithm 1 Our decentralized routing algorithm, which builds on the iterative approximation scheme developed 
by Fan and Nie 2J]. The input parameters are a network G, an origin node, a target node, a time budget r, a CDF 

threshold 0, and a Criterion to identify successor nodes. 



10: 



15: 



20: 



25: 



30: 



function DecentralisedRouting(G, origin, target, r, 6, Criterion 
(current, traveltime) <— (origin, 0) 
steps {(current, traveltime)} 
AV {current} 

while traveltime < r and current 7^ target do 
iVp *r- i e E : j £ Nv and i ^ Ny} 

Vi (t) Wi (t) ^ f {i, target; t) Vi G iVp 

/maT* (t) = max.ejvp [/ {i, target; t)] 

v° (t) ^0 V j G iVv 

w° (t) ^ fL^ir (t) V j G iVv 

unstable N\r 

while current G unstable do 

v:+' (t) = max.ej. (f) «| {t - t') dt'] 

3+1 r rt 



f^p,,{t')w^^{t^t') dt' 



\l i unstable 
Vi G unstable 



(t) — maxj 
s ^ s + 1 

for i G unstable do 

if \vt [t) - wl {t)\ < e Vtthen 

remove i from unstable 
end if 
end for 
end while 

gcurront {t) = arg max^ g j* Pcurrcnt j (*') «j {t - t') dt' 

successor <— CRlTERlON(7;currcnt (t) , ^current (t) , T — travcltlmc, 6) 

traveltime traveltime + random sample of Tsucccssor, current 

add successor to AV 

current ^ successor 

add (current, traveltime) to steps 
end while 
return steps 
end function 



t> Obtain frontier nodes. 
t> Initialize frontier nodes. 

l> Obtain upper bound. 
> Initialize visited nodes. 



> Update the sequences. 



t> Check for convergence. 



> Obtain the successor node. 
> Update the travel time. 
> Extend the known subgraph. 
> Make the step to the successor. 



Arrival Fractions Obtained by Fan et al.'s Centralized Algorithm 



In Fig. [SI we demonstrate that the arrival fraction of Fan et al.'s centralized algorithm [ISj using our joint criterion 
is almost independent of the CDF threshold 6. 



Simulations on the Chicago Network 

We also tested our algorithm using a road network for the United States city of Chicago [28| . We show this network 
in Fig. |6l 

In the main text, we claimed that there exists a function h such that the network distance between two nodes i and 
j is well approximated by h (dij). Our computations with the Chicago network will allow us to examine this claim 
more closely. Figure [7] demonstrates that the Euclidean distance 
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FIG. 5: Arrival fractions obtained by Fan et al.'s centralized algorithm as a function of CDF threshold for tolerance e = 10 
and several different budgets. The error bars in the two panels correspond to three standard deviations from the mean. 
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FIG. 6: The Chicago road network, which has 542 nodes representing junctions and 1084 edges representing roads. We show 
a path in red. 
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is correlated strongly with the network distance. Based on a linear bootstrap fit (29[, the best choice for h w g^j is 

h{d^j) w 0.547(8) km + 1.1176(4) x d^J, 

where dij has units of km. [Recall that "0.547(8)" means that the error bars place the value between 0.539 and 0.555.] 

The slope of the linear fit is larger than 1 because the Euclidean distance between any pair of nodes provides a 
lower bound for the network distance between the nodes. 

We obtain similar results when using the lattice distance 

dij = ~Xj\ + \yi - yj\ (6) 

between nodes i and j. In this case, the slope of the linear fit is smaller than 1 because the lattice distance provides 
an approximate upper bound for the network distance. Complicated paths, such as zigzag paths, can of course violate 
this approximate bound. 

The mean edge length of the network is A ~ 1.89 km. We choose the origin and target nodes uniformly at random 
such that their Euclidean distance lies in the interval [13.1, 16.4] km. We consider the same four tests as in the main 
text and run each test 10'^ times for a range of budgets, a CDF probability threshold 9 = 0.8, and a numerical 
tolerance e = 10"'^. 

We illustrate the arrival fractions as a function of budget, which reveals the same qualitative behavior as on the 
(variant of the) Klcinberg small- world network, in Fig. [HKa). As with the Kleinberg network, the arrival fraction of 
algorithms using the joint criterion exhibits little dependence on the CDF threshold 6 [see Fig. [5Kb)]. In Fig. [HI we 
show that the travel times of centralized algorithms are smaller than those of decentralized algorithms because the 
former know all shortcuts in the network. 
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FIG. 7: Probability density of Euclidean distances between nodes versus network distances between nodes. The Pearson 
correlation coefficient is p « 0.985, which justifies the linear fit shown in red. 
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FIG. 8: (a) Fraction of routing attempts that successfully reach the target on the Chicago network as a function of budget for 
a CDF threshold of 6' = 0.8 and a tolerance of e = 10^"^. (b) Arrival fractions obtained by Fan et al.'s centralized algorithm as 



a function of CDF threshold for tolerance e - 
to three standard deviations from the mean. 
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FIG. 9: Mean travel time of successful routing attempts. We separate small, intermediate, and large budgets using dashed 
vertical lines. The error bars correspond to three standard deviations from the mean. 



Mixture of Convolutions Versus Convolution of Mixtures 



Let F = {fi (x)}, G = {gj (y)} be two finite sets of probability density functions (PDFs) and let the mixtures of 
the elements of the sets be given by 

i 



10 



where w /. and cog^ , respectively, are the independent weights associated with the elements of F and G. Taking the 
mixture after performing the convolution of the elements of F and G gives 

(/» (^) = / '^s; Vw/Jj(z- a;)w<,^..gj (a;) = / dx f {z - x) g (x) = {f * g) (z) . 
^0 Jo 

Therefore, provided that the assumption of independent weights holds, it follows that mixing the result of a convolution 
is equivalent to taking the convolution of two mixtures. 

Consider b sets of probability distributions {Fi, . . . , Fi,}. Let each set Fi have q elements. Carrying out the 
convolutions of all pairs of probability distributions in the sets first and taking the mixture afterwards requires 
(ni=i '^ij convolutions and additions. However, carrying out the mixtures first and then performing the convolutions 

requires b convolutions and (^J2i=i additions. It is thus much more efficient computationally to compute the 
mixtures first and subsequently perform the convolutions. 



